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WHAT DO WE LEARN FROM IRAC OBSERVATIONS OF GALAXIES AT 2 < Z < 3.5? 1 
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ABSTRACT 

We analyze very deep HST, VLT and Spitzer photometry of galaxies at 2 < z < 3.5 in the Hubble 
Deep Field South. The sample is selected from the deepest public K-band imaging currently available. 
We show that the rest-frame U — V vs V — J color-color diagram is a powerful diagnostic of the stellar 
populations of distant galaxies. Galaxies with red rest-frame U — V colors are generally red in rest- 
frame V — J as well. However, at a given U — V color a range in V — J colors exists, and we show that 
this allows us to distinguish young, dusty galaxies from old, passively evolving galaxies. We quantify 
the effects of IRAC photometry on estimates of masses, ages, and the dust content of z > 2 galaxies. 
The estimated distributions of these properties do not change significantly when adding IRAC data 
to the UBVIJHK photometry. However, for individual galaxies the addition of IRAC can improve 
the constraints on the stellar populations, especially for red galaxies: uncertainties in stellar mass 
decrease by a factor of 2.7 for red ((£/ — V) res t > 1) galaxies, but only by a factor of 1.3 for blue 
((17— V) res t < 1) galaxies. We find a similar color-dependence of the improvement for estimates of age 
and dust extinction. In addition, the improvement from adding IRAC depends on the availability of 
full near-infrared JHK coverage; if only K-band were available, the mass uncertainties of blue galaxies 
would decrease by a more substantial factor 1.9. Finally, we find that a trend of galaxy color with 
stellar mass is already present at z > 2. The most massive galaxies at high redshift have red rest- 
frame U — V colors compared to lower mass galaxies even when allowing for complex star formation 
histories. 

Subject headings: galaxies: evolution - galaxies: high redshift - infrared: galaxies 



1. INTRODUCTION 

Two of the major challenges in observational cosmol- 
ogy are understanding the history of star formation 
in galaxies, and understanding the assembly of mass 
through cosmic time. In the local universe elaborate 
surveys mapped the diversity of nearby galaxies (e.g. 
Blanton et al. 2003) and characterized the dependence 
of their colors (Baldry et al. 2004) and star formation 
(Kauffrnann et al. 2003) on galaxy mass. The study of 
their progenitors at z>2 is important since it is believed 
that at this epoch the most massive galaxies formed their 
stars (Glazebrook et al. 2004; van der Wei et al. 2005; 
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Rudnick et al. 2006). 

The first method to efficiently identify distant galaxies 
was the Lyman-break technique (Steidel et al. 1996). 
Large samples have been spectroscopically confirmed 
(Steidel et al. 1999). Their stellar populations have been 
characterized by means of broad-band photometry (e.g. 
Papovich, Dickinson, & Ferguson 2001, Shapley et al. 
2005), optical spectroscopy (e.g. Shapley et al. 2003) 
and near-infrared spectroscopy (Erb et al. 2003, 2006). 
Lyman break galaxies (LBGs) have spectral energy dis- 
tributions similar to nearby starburst galaxies. 

In recent years, new selection criteria provided evi- 
dence for a variety in color space among high redshift 
galaxies as rich as in the local universe. Among the 
newly discovered populations are submm galaxies (e.g. 
Smail et al. 2004), "IRAC Extremely Red Objects" 
(IEROs; Yan et al. 2004), "BzK" objects (Daddi et 
al. 2004) and distant red galaxies (DRGs; Franx et al. 
2003). The latter are selected by the simple color crite- 
rion (J — K)vega > 2.3. Their rest-frame UV-to-optical 
SEDs resemble those of normal nearby galaxies of type 
Sbc-Scd (Forster Schreiber et al. 2004). Near-infrared 
spectroscopy of DRGs (Kriek et al. 2006) and exten- 
sion of the broad-band photometry to mid-infrared wave- 
lengths (Labbe et al. 2005) suggests that evolved stellar 
populations exist already at 2 < z < 3.5. Rudnick et al. 
(2006) showed that DRGs contribute significantly to the 
mass density in rest-frame optically luminous galaxies, 
van Dokkum et al. (2006) studied a stellar mass-limited 
sample of galaxies with M > 10 11 M Q and found that 
DRGs, rather than LBGs, are the dominant population 
at the high mass end at 2 < z < 3. 
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In this paper, we exploit the 3 — 8fj,m imaging of the 
Hubble Deep Field South by Spitzer/IRAC (Fazio et al. 
2004) to extend the SED analysis of distant galaxies to 
the rest-frame near-infrared and constrain their stellar 
masses and stellar populations. Our sample is complete 
to K to t,AB — 25. No color selection criteria are applied. 
The depth of our imaging allows us to probe down to 
stellar masses of a few 10 9 M Q . We investigate whether 
IRAC helps to study the diversity of galaxies at high 
redshift and if the addition of IRAC improves the con- 
straints on stellar mass, age and dust content. Finally, 
we investigate the dependence of galaxy color on stellar 
mass. 

The paper is structured as follows. In f^wc describe 
the data, IRAC photometry and sample definition. 
explains the modeling of spectral energy distributions 
(SEDs). The rest-frame optical to near-infrared color 
distribution of our K-selected sample is discussed in S0I 
^provides an in depth discussion of the constraints that 
IRAC places on estimates of age, dust extinction and 
stellar mass. First wavelength and model dependence 
are discussed from a theoretical perspective. Next we 
discuss results from applying the models to our U-to- 
8/7777 spectral energy distributions. In SjID we investigate 
the rest-frame optical colors of high redshift galaxies as 
a function of stellar mass. Finally, the conclusions are 
summarized in jJJ| 

Throughout the paper we adopt a cosmology with 
H = 70 km s" 1 Mpc' 1 , Q, m = 0.3, and fl A = 0.7. 

2. DATA, PHOTOMETRY AND SAMPLE SELECTION 
2.1. Data 

Observations of the HDFS/WFPC2 field were obtained 
with the IRAC camera (Fazio et al. 2004) on the Spitzer 
Space Telescope (Werner et al. 2004) in June 2004 and 
June 2005 (GTO program 214). A 5' x 5' field of view 
was covered by the 4 broadband filters at 3.6, 4.5, 5.8 
and 8 microns. The data, reduction and photometry 
will be described in detail by Labbe et al. (in prepa- 
ration). Briefly, we started with the Basic Calibrated 
Data (BCD) as provided by the Spitzer Science Center 
pipeline. We applied a series of procedures to reject cos- 
mic rays and remove artifacts such as column pulldown, 
muxbleed, and the "first frame effect" (Hora et al. 2004). 
Finally, the frames were registered to and projected on 
a 2x2 blocked (0'.'2396 pixel scale) version of an exist- 
ing ISAAC K-band image (Labbe et al. 2003, hereafter 
L03) 1 , and average-combined. Characteristics such as 
exposure time, FWHM, limiting depth (5a, 3" diameter 
aperture) and positional accuracy in each of the 4 IRAC 
bands are summarized in Table ^ A summary of the 
optical-to-NIR observations by L03 is provided in Table 
13 All magnitudes quoted in this paper are in the AB 
system. 

2.2. Photometry 

In this section we describe the steps to combine the 
IRAC data and optical-to-NIR data (L03) into one con- 
sistent K-band selected photometric catalog. In this pa- 
per we limit ourselves to the 2.5' x 2.5' field where very 

1 NIR data from the FIRES survey of the HDFS is publicly 
available from http://www.strw.lcidcnuniv.nl/"' fires 




Fig. 1. — Postage stamps (9'.' 8 X 9'.' 8) illustrating the deblending 
procedure for IRAC photometry. Confusion by nearby neighbors in 
the original 3.6fim image (a) is reduced using the higher resolution 
K-band image (b) and its SExtractor segmentation map (c). A 
model 3.677m image (d) is created using information on position 
and extent of the galaxies from the K-band image. The model of 
the nearby neighbors (e) is subtracted from the original image to 
obtain a cleaned 3.6/xm image (f). 



deep K-band data is available from L03. The main chal- 
lenge in doing IRAC photometry is a proper treatment 
of source confusion and PSF matching of the data. In- 
tegrating for nearly 4 hours with IRAC at 3.6/oti and 
4.5/iTO reaches a depth only 1 mag shallower than 36 
hours of ISAAC K-band imaging (lOcr limit K tot AB = 
25), but the IRAC images have a 4 times broader PSF 
causing many sources to be blended. Information on the 
position and extent of K-band detected objects was used 
to fit and subtract the fluxes of neighbouring sources. 
Each K-band detected source was isolated using the SEx- 
tractor "segmentation map" and convolved individually 
to the considered IRAC PSF. Next all convolved sources 
were fitted to the IRAC image, leaving only their fluxes 
as free parameters. We subsequently subtract the best-fit 
fluxes of all neighboring sources to remove the contami- 
nation. An illustration of this measurement technique is 
presented in Fig^ The resulting cleaned IRAC images 
are matched to the broadest PSF (of the 8/xm image). 
We measured fluxes on the cleaned, PSF-matched images 
within a fixed 4'.'4 diameter circular aperture. The aper- 
ture size is a compromise between quality of PSF match- 
ing (within 3% as derived from dividing growthcurves) 
and adding too much noise. Finally we applied for each 
source an aperture correction to scale the IRAC fluxes 
to the "color" apertures defined for the K-band catalog 
by L03. The correction factor is the ratio of the original 
K-band flux in the color aperture and the K-band flux in 
the 8/1771 PSF matched image within a 4'.'4 diameter aper- 
ture. Photometric errors were calculated by taking the 
rms of fluxes in 4'.'4 diameter apertures on empty places 
in the IRAC image. The end product is a photometric 
catalog with consistent photometry from optical to MIR 
wavelengths with 11 filters (UBVIJHK+IRAC). 

2.3. Sample selection 

From the catalog described in A2.2I we selected all 
galaxies, well covered by all 11 filters, that have S/N > 
10 in the K-band. The sample reaches to a limiting total 
K-band magnitude K to t.AB = 25. 
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Table 1. Characteristics of the IRAC observations 



Filter 
W 


Exposure time 
(hr) 


FWHM 

(") 


Limiting depth 
(5a, 3" diameter aperture) 


Positional Accuracy a 
(") 


3.6 


3.76 


1.95 


25.6 


0.09 


4.5 


3.76 


1.90 


25.6 


0.15 


5.8 


3.76 


2.10 


23.4 


0.14 


8.0 


3.64 


2.15 


23.3 


0.11 



a The rms difference between bright star positions in IRAC and K-band image. 



Table 2. Characteristics of the optical-to-NIR observations (see L03) 



Instrument/Telescope Filter Exposure time FWHM Limiting depth 

(hr) (") (5cr, 0''7 diameter aperture) 



WFPC2/HST 


F300W 


36.8 


0.16 


27.8 


WFPC2/HST 


^450^ 


28.3 


0.14 


28.6 


WFPC2/HST 


F606W 


27.0 


0.13 


28.9 


WFPC2/HST 


F8UW 


31.2 


0.14 


28.3 


ISAAC/VLT 


Js 


33.6 


0.45 


26.9 


ISAAC/VLT 


H 


32.3 


0.48 


26.4 


ISAAC/VLT 


K a 


35.6 


0.46 


26.4 



Since spectroscopic redshifts are only available for 63 
out of 274 objects, we mostly rely on photometric redshift 
estimates to select high redshift galaxies and compute 
rest-frame colors and luminosities. The photometric red- 
shifts and derived rest-frame photometry were calculated 
as follows. We used an algorithm developed by Rudnick 
et al. (2001, 2003) to fit a nonnegative linear combination 
of galaxy templates to the spectral energy distribution of 
each galaxy. The template set consisted of empirical E, 
Sbc, Scd and Im templates from Coleman, Wu,& Weed- 
man (1980), the two least reddened starburst templates 
from Kinney et al. (1996) and two Bruzual & Char- 
lot (2003; BC03) single stellar populations (SSP) with a 
Salpeter (1955) stellar initial mass function (IMF), aged 
1 Gyr and 10 Myr respectively. The empirical templates 
were extended into the IR using the BC03 stellar popu- 
lation synthesis code. The derived photometric redshifts 
show a good agreement with the available spectroscopic 
redshifts. The average value of \z spec — z p h ot \j(\ + z spec ) 
is 0.06, 0.09 and 0.08 for galaxies at < z < 1, l<z<2 
and 2 < z < 3.5 respectively. 

Once the redshift was derived, we calculated rest- 
frame luminosities and colors by interpolating between 
observed bands using the best-fit templates as a guide. 
For a detailed description, we refer to Rudnick et al. 
(2003). 

The K-band selected sample contains 121 sources at 
< z < 1, 72 at 1 < z < 2 and 75 at 2 < z < 3.5. The 
K+IRAC photometry of the galaxies at 2 < z < 3.5 is 
provided as an electronic table (see Table El for a sample 
entry). In 21 we study the color-distribution of galaxies 
with Ly > 5 x 1O 9 L0 over the whole redshift range. 
From thereon, we will focus on the high redshift bin. 
Two commonly color-selected populations at z > 2 will 
be highlighted where of interest. Lyman- break galaxies 
(LBGs) are selected from the WFPC2 imaging using the 
criteria of Madau et al. (1996). Distant Red Galaxies 
(DRGs) are selected by the simple color criterion (J — 
K)ab > 1-34 (Franx et al. 2003). 



3. SED MODELING 

To study physical characteristics of the galaxies such 
as stellar mass, stellar age and amount of dust extinction, 
we make use of the evolutionary synthesis code developed 
by BC03. We fitted the synthetic spectra to our observed 
SEDs using the publicly available HYPERZ stellar pop- 
ulation fitting code, version 1.1 (Bolzonella et al. 2000). 
Redshifts were fixed to the z p hot measurement (see ^2.31 
Rudnick et al. 2003) or z spec when available. A mini- 
mum error of 0.08 mag was adopted to avoid that the 
data points with the largest errors are effectively ignored 
in the SED fits. We fitted three distinct star formation 
histories: a single stellar population (SSP) without dust, 
a constant star-formation (CSF) history with dust {Ay 
varying from to 4 in steps of 0.2) and an exponen- 
tially declining star-formation history with an e-folding 
timescale of 300 Myr (T300) and identical range of Ay 
values. The exponentially declining model allows for qui- 
escent systems that underwent a period of enhanced star 
formation in their past. Forster Schreiber et al. (2004) 
showed that the estimated extinction values do not vary 
monotonically with the e-folding timescale r, but reach 
a minimum around 300 Myr. Including the T300 model 
thus ensures that the allowed star formation histories en- 
compass the whole region of parameter space that would 
be occupied when fitting models with different values 
of t. For each of the star- formation histories (SFHs), 
we constrained the time elapsed since the onset of star 
formation to a minimum of 50 Myr, avoiding fit results 
with improbable young ages. The age of the universe at 
the observed redshift was set as an upper limit to the 
ages. Furthermore, we assume a Salpeter (1955) IMF 
with lower and upper mass cut-offs 0.1M Q and 100M Q , 
solar metallicity and we adopt a Calzetti et al. (2000) 
extinction law. For each object the star-formation his- 
tory resulting in the lowest x 2 of the fit was selected 
and corresponding model quantities such as age, mass 
and dust extinction were adopted as the best-fit value. 
We calculated the mass- weighted age for each galaxy by 
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Table 3. K+IRAC photometry of HDFS galaxies at 2 < z < 3.5 [exam- 
ple ENTRY OF ELECTRONIC TABLE] 



Object 3 


JK,tot 


Ik, coi 


J3.6fJ,rn,col 


j4:.5fj,7n,col 


j5.&jj,m,col f8.0fi7n,col 


176 


6.75 ±0.47 


5.07 ±0.17 


8.60 ±0.23 


10.68 ±0.25 


14.43 ± 1.74 9.30 ±2.06 



a Object identification number corresponds to that of the U-to-K catalog by Labbe et al. (2003). 

b Fluxes in total (tot) and color (col) aperture are scaled to an AB zcropoint of 25, i.e. mag^g = 25 — 2.5 log/. 



Table 4. Modeling results for HDFS galaxies at 2 < z < 3.5 [example 

ENTRY* OF ELECTRONIC TABLE] 



Object 3 


z SFH log(M„) A v 


\og(Age w ) 
(Gyr) 


176 


n vc +0.10 1n „+0.07 n c+0.4 
2 - 76 -0.34 r 300 10.77_ O 04 0.6_ 02 


-0 25+ - 26 



a Object identification number corresponds to that of the U-to-K catalog by Labbe et al. (2003). 



integrating over the different ages of SSPs that build up 
the SFH, weighting with their mass fraction. We use this 
measure since it is more robust with respect to degenera- 
cies in SFH than the time passed since the onset of star 
formation; it describes the age of the bulk of the stars. 
An electronic table, the contents of which are illustrated 
in Tabled summarizing the results of our SED modeling 
for the subsample of galaxies at 2 < z < 3.5 is available 
in the online Journal. In Fig. [21 we show example U-to- 
8/im SEDs with best-fit BC03 models of galaxies over 
the whole redshift range, illustrating that at all epochs a 
large variety of galaxy types is present. 

We fitted all objects in our sample twice, once with 
and once without IRAC photometry. We repeated the 
SED modeling with the same parameter settings using 
the models by Maraston et al. (2005). The results will 
be discussed in i)5.2.2l Variations in modeled parameters 
due to a different metallicity are addressed in ^5.2. 31 The 
effects of adopting a different extinction law are discussed 
in S15.2.4I Unless mentioned otherwise, we refer to stel- 
lar mass, mass- weighted age and dust extinction values 
derived from the U-to-8/im SEDs with BC03 models. 

4. REST-FRAME OPTICAL TO NEAR-INFRARED COLOR 
DISTRIBUTION 

At redshifts above 1 all rest-frame near-infrared bands 
have shifted redward of observed K and mid-infrared 
photometry is needed to compute rest-frame NIR fluxes 
from interpolation between observed bands. It has only 
been with the advent of IRAC on the Spitzer Space Tele- 
scope that the rest-frame near-infrared opened up for the 
study of high redshift galaxies. As the 3.6/im and A.5fim 
images are much deeper than the 5.8/im and 8.0/im im- 
ages (see Table ^| , we focus on the rest- frame J- band 

{.Jrest ) • 

Several studies have focussed on the optical to near- 
infrared colors and inferred stellar populations of par- 
ticular color-selected samples (e.g. Shapley et al. 2005, 
Labbe et al. 2005). In this section we take advantage 
of the multiwavelcngth data and the very deep K-band 
selection to study the rest-frame optical to near-infrared 
colors of all galaxies up to z — 3.5 without color bias. For 
the first time we can therefore investigate what range in 



optical to near-infrared colors high redshift galaxies oc- 
cupy, how their optical to near-infrared colors relate to 
pure optical colors, and what this tells us about the na- 
ture of their stellar populations. 

In Fig. |3 we present a color-color diagram of (U — 
V) rest versus (V — J) res t for the redshift bins < z < 1, 
1 < z < 2 and 2 < z < 3.5. A clear correlation of 
(U — V) res t with (V — J)rest is observed at all redshifts. 
The (U — V) res t color samples the Balmer/4000A break. 
The large wavelength range spanned by (U — V) res t and 
(V— J)rest together is useful to probe reddening by dust. 

To study how the color distribution compares to that 
in the local universe, we indicate the colors of galax- 
ies in the low-redshift New York University Value-Added 
Galaxy Catalog (NYU.VAGC, Blanton et al. 2005) with 
small grey dots. The low-z NYU.VAGC is a sample of 
nearly 50000 galaxies at 0.0033 < z < 0.05 extracted 
from the Sloan Digital Sky Survey (SDSS data release 
4, Adelman-McCarthy et al. 2006). It is designed to 
serve as a reliable reference for the local galaxy popula- 
tion and contains matches to the Two Micron All Sky 
Survey Point Source Catalog and Extended Source Cat- 
alog (2MASS, Cutri et al. 2000). Only the subsample 
of 20180 sources that are detected in the 2MASS J-band 
are plotted in Fig. [21 This results effectively in a reduc- 
tion of the blue peak of the bimodal U — V distribution. 
We only show those galaxies (both for the local sample 
and for our sample of HDFS galaxies) with a rest-frame 
V-band luminosity Lv > 5 x 10 9 L©. At this luminosity 
the distribution of low-z NYU.VAGC galaxies with SDSS 
and 2MASS detections starts falling off. From the much 
deeper HDFS imaging the luminosity cut weeds out low 
to intermediate redshift dwarf galaxies. 

The same trend of optically red galaxies being red in 
optical to near-infrared wavelengths that we found for 
galaxies up to z — 3.5 is observed in the local universe. 
However, there are two notable differences in the color 
distribution between distant and local galaxies. First, a 
population of luminous high redshift galaxies with very 
blue (U — V) res t and (V — J) res t exists without an abun- 
dant counterpart in the local universe. The 2MASS 
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Fig. 2. — The U-to-8/^m spectral energy distributions of a subset of galaxies occupying different locations in (U — V) r est vs (V — J)rest 
color-color space. Each row shows observed and BC03 model SEDs for galaxies with redshifts ranging from z ~ 0.7 to z ~ 3. A broad range 
of galaxy types is present at all redshifts. Galaxies with blue (U — V) res t colors (upper row) have young ages and a modest amount of dust 
obscuration. Objects with red (U — V) res t colors that are on the blue side of the (V — J) r est color distribution (middle row) are best fit 
by old stellar populations with little dust obscuration. The bottom row shows examples of galaxies with red optical and red optical-to-NIR 
colors. They are consistent with young stellar populations with a large dust reddening. 
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1O 9 L . SDSS+2MASS galaxies (small 
local reference. Color coding refers to 
with red U — V colors are also red in 
local SDSS galaxies the high redshift 
to bluer U — V colors (where Lyman- 
and for the same U — V color to redder 



observations are not deep enough to probe very blue 
V — J colors, but we can ascertain that 95% of all low- 
z NYU_VAGC sources with L v > 5 x 10 9 L Q lie in the 
range 0.73 < U-V < 2.24. About half of the blue galax- 
ies at z > 2 with (U - V) rest < 0.73 and L v > 5 x 10 9 L© 
satisfy the Lyman-break criterion. Their stellar popula- 
tions have been extensively studied (e.g. Papovich et al. 
2001, Shapley et al. 2001, among many others) and their 
blue SEDs (see e.g. object #242 and #807 in Fig.|U are 
found to be well described by relatively unobscured star 
formation. The rest-frame optical bluing with increas- 
ing redshift of galaxies down to a fixed Ly is thoroughly 
discussed by Rudnick et al. (2003). 

A second notable difference with respect to the color 
distribution of nearby galaxies is present at (U — V) res t > 
1, where most local galaxies reside. Our sample of 
HDFS galaxies has a median offset with respect to the 
SDSS+2MASS galaxies of 0.22 ± 0.04 mag towards red- 
der (V — J)rest at a given (U — V) res t- Furthermore, 
the spread in (V — J) r est is larger, extending from colors 
similar to that of local galaxies to (V — J) res t colors up 
to a magnitude redder. The larger spread in (V — J) re st 
colors at a given (U — V) res t is not caused by photomet- 
ric uncertainties. After subtraction in quadrature of the 
scatter expected from measurement errors (0.05 mag) we 
obtain an intrinsic scatter of 0.3 mag, significantly larger 
than that for SDSS+2MASS galaxies (0.19 mag) at a 
4.5cr level. 

In order to understand the nature of galaxies with sim- 
ilar or redder (V — J) res t than the bulk of nearby galax- 
ies, we make use of stellar population synthesis models 
by Bruzual & Chariot (2003; BC03). In Fig.glwe draw 
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Fig. 4. — Rest-Frame U—V versus V — J color-color diagram of all 
galaxies with Ly > 5 X 1O 9 L . SDSS+2MASS galaxies (small grey 
dots) are plotted as a local reference. The dust vector indicates an 
extinction of Ay = 1 mag. Color evolution tracks of (unreddened) 
Bruzual & Chariot (2003) models are overplotted: a simple stel- 
lar population (SSP, solid line), an exponentially declining (T300, 
dotted line) and constant (CSF, dashed line) star formation model. 
Galaxies are color-coded by best-fit mass-weighted age. The tracks 
show an increase towards redder U — V and slightly redder V — J 
with age. At a given U — V color redder than 1 galaxies that are 
red in V — J have the youngest best-fit mass-weighted ages. 



age tracks for three different dust-free star formation his- 
tories in the (U — V) rest vs (V — J) res t color-color dia- 
gram. The solid line represents a single stellar popula- 
tion (SSP), the dashed line a continuous star formation 
model (CSF) and the dotted line an exponentially de- 
clining star formation model with an e-folding timescale 
of 300 Myr (rsoo)- All star formation histories show an 
evolution to redder (U — V) res t and (V — J) r est with age. 
The T300 model first has similar colors as a CSF model 
and eventually moves to the same region in color space as 
an evolved SSP, namely where the red peak of the SDSS 
bimodal U — V distribution is located. In the absence of 
dust a population with a constant star formation history 
only reaches U — V = 1 in a Hubble time. 

We now investigate how the location in this color plane 
is related to stellar populations. Using the best-fit model 
parameters (see we plot the mass-weighted ages for 
the galaxies with Ly > 5 x 10 9 L© with color-coding on 
Fig. 2] Galaxies with blue optical colors are indeed found 
to be young, the median mass-weighted age for galaxies 
at (U - V) res t < 1 being 250 Myr. At (U - V) rest > 1 
galaxies with a wide range of stellar ages are found. The 
oldest stellar populations show the bluest (V — J) r est 
colors at a given (U — V) rest . Over the whole redshift 
range galaxies are present that have red optical colors 
and whose SEDs are consistent with evolved stellar pop- 
ulations and low dust content. According to their best-fit 
model, three of them started forming stars less than 0.5 
Gyr after the big bang and already at z > 2.5 have star 
formation rates less than a percent of the past-averaged 
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Fig. 5. — Rest-Frame U — V versus V — J color-color diagram of 
all galaxies with L v > 5 X 1O 9 L . SDSS+2MASS galaxies (small 
grey dots) are plotted as a local reference. The vector indicates a 
dust extinction of Ay = 1 mag. Galaxies are color-coded by best- 
fit Ay . The presence of dust moves galaxies to redder U — V and 
V — J colors. Galaxies falling redward in V — J of the distribution 
of local galaxies are best described by dusty stellar populations. 



value. We note that in the Chandra Deep Field South 
Papovich et al. (2006) find a number density of passively 
evolving galaxies at high redshift that is nearly an order 
of magnitude lower than in the HDFS, possibly owing to 
the fact that the HDFS observations probe to fainter K- 
band magnitudes. The red (V — J) r est side of the color 
distribution is made up of galaxies that are best fitted 
by young stellar populations. Since the age tracks alone 
cannot explain the presence of galaxies with such red 
SEDs from the optical throughout the near-infrared, we 
investigate the role of dust in shaping the galaxy color 
distribution. 

Fig. |5] shows again the (U — V) rest versus (V — J) r est 
color-color diagram, now color-coded by best-fit dust ex- 
tinction, expressed in Ay. The arrow indicates an Ay 
of 1 magnitude using a Calzetti et al. (2000) extinction 
law. It is immediately apparent that the optical to near- 
infrared color-color diagram is a useful diagnostic for dis- 
tinguishing stellar populations with various amounts of 
dust extinction. At the bluest (U — V) res t colors there is 
little evidence for dust obscuration. The degree of dust 
extinction increases as we move along the dust vector to 
redder colors. 

Independent constraints on dust-enshrouded activity 
in distant galaxies can be derived from MIPS 24/zm imag- 
ing (Webb et al. 2006; Papovich et al. 2006). The mid- 
infrared emission is usually thought to be powered by a 
dusty starburst in which PAH features are produced or 
by an active galactic nucleus (AGN). Ninety- five % of the 
area with very deep U-to-8/im in the HDFS is covered by 
a 1-hour MIPS pointing. We performed the same pho- 
tometric procedure to reduce confusion as for the IRAC 
photometry (see i)2.2f> . Fluxes were measured within a 
6" diameter aperture and then scaled to total using the 
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Fig. 6. — Rest-Frame U — V versus V — J color-color diagram 
of galaxies at 1.5 < z < 3 with L v > 5 X 1O 9 L with MIPS 24fim 
coverage. SDSS+2MASS galaxies (small grey dots) are plotted as 
a local reference. Filled circles represent MIPS 2Afim detections 
above a 28fiJy (3tr) threshold. #767 is detected at 2Afim and 
has an excess of 8/im flux compared to the best-fitting template 
SED, suggesting the presence of an obscured AGN. All other 24/im 
detections lie in the U — V , V — J region populated by galaxies with 
dusty stellar populations. Assuming the 24 micron flux originates 
from PAH emission produced by dust-enshrouded star formation, 
the MIPS observations confirm the diagnostic power of this color 
combination. 



growthcurve of the 24/im PSF. 

In Fig. we plot the (U — V) rest versus (V — J) res t 
color-color diagram of all objects in the redshift interval 
1.5 < z < 3.5 with Lv > 5 x 1O 9 L that are covered 
by MIPS (empty circles). At these redshifts strong PAH 
features, if present, move through the MIPS 24/im pass- 
band. Six sources have a MIPS 24/im detection above 
28 fiJy (3c). Their 24/im flux is indicated by the colored 
filled circles. Object #767 (red circle) is well detected 
with F 2 4 tim = 95/tJy. As noted by Labbe et al. (2005) 
its SED shows an 8/im excess with respect to the best- 
fitting template. The combination of 8/im excess and 
24/im detection suggests that this galaxy hosts an AGN 
whose power law SED dominates throughout the mid- 
infrared. All other 24/im detections are located in the 
part of the diagram where our U-to-8/im SED modeling 
found dusty young populations. None of the blue rel- 
atively unobscured star-forming galaxies or red evolved 
galaxies show evidence of PAH emission from the ob- 
served 24/im flux. There are various reasons why not all 
straforming dusty galaxies have a 24/im detection. The 
density of the UV radiation field exciting the PAHs may 
vary among galaxies. Furthermore, the narrow PAH fea- 
tures with respect to the width of the 24/im passband 
make the 24/im flux very sensitive to redshift. Overall, 
MIPS observations agree well with SED modeling and 
rest-frame optical-NIR color characterization. 

We conclude that over the whole redshift range from 
z = to z = 3.5 a trend is visible of galaxies with redder 
optical colors showing redder optical to near-infrared col- 
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Fig. 7. — Comparison of galaxies with similar (U — V) re st color 
but different (V — J) res t color. The top row shows the galaxies with 
blue (V — J)rest colors, and the bottom row shows galaxies with 
matching (U — V) res t color but much redder (V — J) res t color. The 
systematic difference in the SEDs of the two rows is striking. Fits 
indicate old bursts of star formation with little dust in the top row, 
and dusty young galaxies in the bottom row. This demonstrates 
the power of (V — J) r est in separating these classes. Note that the 
U-band photometry for objects #224 and #249 deviates by more 
than 2tr from the predicted U-band flux of the best-fit template. 



ors. However, at a given optical color, a spread in optical 
to near-infrared colors is observed that is larger than for 
nearby galaxies. At (U— V) res t > 1 evolved galaxies with 
little dust extinction are found at the bluest (V — J) res t- 
Dusty young star-forming galaxies occupy the reddest 
(V — J)rest colors. This is once more illustrated by the 
SEDs of galaxies with (U — V) rest > 1 presented in Fig. 
[7J The upper row shows SEDs of objects at the blue 
side of the (V — J) re st color distribution. The panels be- 
low show SEDs of galaxies matched in (U — V) res t, but 
with comparatively redder {V — J) re st colors. The latter 
galaxies have comparatively younger ages and a larger 
dust content. Since this distinction could not be made 
on the basis of (U — V) res t color alone, the addition of 
IRAC 3.6 — 8/tm photometry to our U-to-K SEDs proves 
very valuable for the understanding of stellar populations 
at high redshift. 

We verified that no substantial changes occur to the 
rest-frame optical-to-NIR color distribution and its inter- 
pretation in terms of age and dust content of the galaxies 
when we derive photometric redshifts by running HY- 
PERZ with redshift as free parameter instead of using 
the algorithm developed by Rudnick et al. (2003, see 

5. CONSTRAINTS ON STELLAR POPULATION 
PROPERTIES AT 2 < Z < 3.5: AGE, DUST AND MASS 

We now proceed to analyse in more detail the con- 
straints that IRAC places on the stellar populations of 
the subsample of galaxies at 2 < z < 3.5 (75 galaxies). In 
particular we will focus on stellar mass, which likely plays 
a key role in galaxy evolution at all redshifts (e.g. Kauff- 
mann et al. 2003; Bundy et al. 2005; Drory et al. 2005; 
Rudnick ct al. 2006). Fortunately, estimates of stellar 
mass from modeling the broad-band SEDs are generally 
more robust than estimates of dust content and stellar 
age (Bell& de Jong 2001; Shapley et al. 2001; Papovich 
et al. 2001; Forster Schreiber et al. 2004). Neverthe- 
less, translating colors to mass-to-light ratios and subse- 
quently stellar masses requires a good understanding of 



the effects of age and dust. 

5.1. Predictions from stellar population synthesis models 

5.1.1. Wavelength dependence: optical vs near-infrared 

In its simplest form, the stellar mass of a galaxy can be 
estimated from one color (see e.g. Bell et al. 2001). To 
illustrate this process, we present the evolutionary track 
of a dust-free BC03 model in a M/Ly versus U — V di- 
agram (Fig. |SJ. Up to 2.5 Gyr after the onset of star 
formation (upper right corner of Fig. 0) the track rep- 
resents a 1-component population with a star formation 
history that is exponentially declining, with an e-folding 
timescale of 300 Myr. For most of the galaxies in our 
sample this was the best-fitting star formation history. 
For more extreme star formation histories such as an 
SSP or CSF the process of estimating M/L values follows 
similar arguments. The filled circles on Fig. [5] represent 
age steps of 100 Myr. As the stellar population ages, 
its V-band luminosity fades with only a small decrease 
in stellar mass from mass loss, moving the galaxy up in 
M/Ly. Simultaneously the U — V color reddens as the 
hot early-type stars with short lifetimes die. The dust 
vector indicating a reddening of Ay = 1 mag runs paral- 
lel to the age track of the 1-component model. Ironically, 
the mass estimate benefits greatly from this degeneracy 
between age and dust in the optical. Under the assump- 
tion of a monotonic star formation history (U — V) res t 
can uniquely be translated to M/Ly, regardless of the 
precise role of dust or age. Only a normalization with 
Ly is needed to derive the stellar mass. A similar rela- 
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Fig. 8. — Evolutionary track of a 2-component stellar population 
in the M/Ly versus U — V plane. Filled circles mark age steps 
of 100 Myr. Open circles represent 10 Myr age steps. The dust 
vector, indicating an extinction of Ay = 1 mag, lies parallel to 
the age track. The histogram represents the color distribution of 
galaxies at 2 < z < 3.5, with DRGs highlighted in red, LBGs 
in blue. The age track starts as an exponentially declining star 
formation model (t = 300 Myr, BC03). At 2.5 Gyr a new burst of 
star formation is introduced, lasting 100 Myr and contributing 20% 
to the mass. Translating U — V into M/Ly assuming 1-component 
models can lead to underestimates of M/Ly and thus stellar mass. 
The possible underestimate is largest for blue galaxies. 



tion was used by Rudnick et al. (2003) to translate the 
integrated (U — V) rest color of high redshift galaxies into 
a global M / Ly and stellar mass density . They found 
that the conversion to mass-to-light ratio is more robust 
from the (U — V) res t color than from the (U — B) rest or 
(B - V) rest color. 

What if the actual star formation history is more com- 
plex? What effect does it have on the derived stellar 
mass? There is ample evidence from local fossil records 
(e.g. Trager et al. 2000; Lancon et al. 2001; Freeman& 
Bland-Hawthorn 2002; Forster Schreiber et al. 2003; An- 
geretti et al. 2005) and high redshift studies (e.g. Pa- 
povich et al. 2001; Ferguson et al. 2002; Papovich et 
al. 2005) that galaxies of various types have complex 
and diverse star formation histories, often with multiple 
or recurrent episodes of intense star formation . Such 
a scenario is also predicted by cold dark matter models 
(e.g. Somerville, Primack,& Faber 2001; Nagamine et 
al. 2005; De Lucia et al. 2005). In order to address 
this question qualitatively, we consider the case of a 2- 
component population. At t = 2.5 Gyr we added a burst 
of star formation to the T300 model, lasting 100 Myr and 
contributing 20% to the mass. To follow the evolution 
of the 2-component population closely, we mark 10 Myr 
timcstcps with open circles. Over a timespan of only 10 
Myr the galaxy color shifts by 1.6 mag towards the blue, 
while the M/Ly value stays well above the 1-component 
M/Ly corresponding to that color. As the newly formed 
stars grow older, the galaxy moves towards the upper 
right corner of the diagram again. The offset of M/Ly 
with respect to the 1-component model is a decreasing 
function of U — V. This means that if a bursty star for- 
mation is mistakenly fit with a 1-component model the 
mass and mass-to-light ratio are underestimated more 
for blue than for red galaxies, confirming what Shapley 
et al. (2005) found for a sample of star-forming galaxies 
at z > 2. 

The histogram on the bottom of Fig. [S] indicates the 
(U — V) rest color distribution of galaxies in the HDFS 
at 2 < z < 3.5. The population of Lyman-break galaxies 
(LBGs) is marked in blue, Distant Red Galaxies (DRGs) 
in red. The possible underestimate in mass-to-light ratio 
and thus mass is largest for blue galaxies, up to a factor 
of 3 for (U — V) rest = 0.2, the bluest color reached by this 
2-component model. For DRGs only a modest amount 
of mass can be hidden under the glare of a young burst 
of star formation. The exact error that bursts cause de- 
pends on the form of the bursty star formation history 
(see e.g. Fig. 6 in Rudnick et al. 2003 for a different 
example) . 

We can now test whether rest-frame near-infrared pho- 
tometry, as provided by IRAC, improves the constraints 
on the SED-based stellar mass estimates of high redshift 
galaxies. Labbe et al. (2005) found that the range in 
M/Lk for DRGs and LBGs together is as large as a fac- 
tor 6, meaning that a Spitzcr 8/im-selected sample would 
be very different from a mass-selected sample. However, 
if a similar relation between mass-to-light ratio and color 
exists in the rest-frame near-infrared as in the rest-frame 
optical, this does not mean that the stellar mass estimate 
is uncertain by the same amount (a factor of 6). Here 
we consider whether the mass-to-light ratio can robustly 
be derived from a given rest-frame near-infrared color. 
We discuss only the rest-frame J-band but note that the 
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results for rest-frame K are similar. In Fig. [5] we repeat 
the same exercise of drawing a M/L versus color evolu- 
tionary track for the rest-frame near-infrared. The burst 
that we superposed on the T300 model after 2.5 Gyr is 
again contributing 20% to the mass over a period of 100 
Myr. Note that the scale is identical to that of Fig. |H1 
The (V— J)rest histogram of sources at 2 < z < 3.5 is de- 
rived from observed near- to mid- infrared wavelengths. 
During the first Gyr, the V — J color hardly changes 
whereas M/Lj does by a factor of 7. As an immedi- 
ate consequence, the translation of V — J into M/Lj 
is highly uncertain for the blue galaxies in our sample 
and the additional IRAC observations do not improve 
the constraints on the mass-to-light ratio. The situation 
is further complicated by the effect of dust. V — J is 
a lot more sensitive to dust than M/Lj, illustrated by 
the dust vector of Ay — 1 mag. The effects of dust and 
age no longer conspire to give robust mass estimates at a 
given V — J color. At redder V — J the situation improves 
as the slope of the age track flattens. Here the inclusion 
of a rest-frame NIR color clearly reduces the uncertainty 
in stellar M/L that stems from the poor knowledge of the 
star formation history. The loop towards bluer colors is 
a magnitude smaller in size and we see no large offsets 
in M/L between the 1- and 2-component modeling. 

We have discussed the different behavior of dust and 
age in simplified 1- and 2-component models and have 
investigated the improvements expected from the inclu- 
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Fig. 9. — Evolutionary track of a 2-component stellar population 
in the M/Lj versus V — J plane. A T300 model from BC03 is 
shown. At 2.5 Gyr a 100 Myr burst is added , contributing 20% 
to the mass. Age marks represent 100 Myr (filled circles) and 10 
Myr (open circles) respectively. The histogram shows the color 
distribution of our sample at 2 < z < 3.5, with DRGs in red and 
LBGs in blue. For blue galaxies the V — J color is insensitive to 
M/L j, further complicated by the dust vector (Ay = 1 mag) that 
lies nearly orthogonal to the age track meaning blue galaxies can 
have a range of masses for the same V — J color. On the other hand, 
the introduction of a second burst only causes a small offset in 
M/L j from the single-component track, showing that the inclusion 
of a rest-frame NIR band reduces the uncertainties in stellar M/L 
caused by poor knowledge of the star formation history. 



sion of the rest-frame NIR with respect to the rest-frame 
optical. While additional rest-frame NIR data can lead 
to better M/L estimates, in particular for redder galax- 
ies (U — V > 1; V — J > 0.4), it is clear that we need to 
take advantage of the full U-to-8/im SED information to 
derive reliable estimates of stellar mass, stellar age and 
dust content. 

5.1.2. Model dependence: Bruzual & Chariot vs 
Maraston 

It is important to note that different stellar popula- 
tion synthesis models do not paint a consistent picture 
of evolution in the rest-frame near-infrared. To illustrate, 
we compare Bruzual & Chariot (2003; BC03) models to 
Maraston (2005; M05) models under the same assump- 
tion of Salpeter initial mass function and solar metallic- 
ity. 

Whereas the age track in a M/ Ly versus U — V di- 
agram behaves similarly for M05 and BC03, the near- 
infrared evolution of a T300 model looks very different 
(see Fig. ■ The grey dashed line represents the age 
track of a BC03 r 30 o model with superposed burst at 2.5 
Gyr as described in M5.1.1I In black we overplot the age 
track of a 2-component model with identical parameters 
by M05. In the 0.2 — 2 Gyr age range the two models 
look strikingly different. At the same V — J color the 
M05 model predicts M/Lj values that are up to a factor 
2.5 smaller than those of the BC03 model. The offset 
between M/Lj as predicted from 1- and 2-component 
modeling is also larger by a similar factor. The BC03 
and M05 models differ in several aspects: the stellar evo- 
lutionary tracks adopted to construct the isochrones, the 
synthesis technique and the treatment of the thermally 
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Fig. 10. — Evolutionary track of 2-component stellar populations 
in the M/Lj vs V — J plane based on BC03 (grey dashed line) and 
Maraston (2005, M05, black solid line) models. For ages between 
0.2 and 2 Gyr, the M05 model predicts much lower M/Lj values 
than the BC03 model. The underestimate of M/ Lj as derived from 
1-component modeling is therefore much more severe for the M05 
model than for the BC03 model, and the inclusion of rest-frame 
NIR data does not necessarily improve constraints on stellar M/L. 
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pulsating Asymptotic Giant Branch (TP-AGB) phase. 
The Padova stellar tracks (Fagotto et al. 1994) used by 
BC03 include a certain amount of convective-core over- 
shooting whereas the Frascati tracks (Cassisi et al. 1997) 
do not. The two stellar evolutionary models also differ 
for the temperature distribution of the red giant branch 
phase. The higher near-infrared luminosity originates 
mainly from a different implementation of the Thermally 
Pulsating Asymptotic Giant Branch (TP-AGB) phase 
(M05). Following the fuel consumption approach, M05 
finds that this phase in stellar evolution has a substantial 
impact on the near-infrared luminosity at ages between 
0.2 and 2 Gyr. BC03 follow the isochrone synthesis ap- 
proach, characterizing properties of the stellar popula- 
tion per mass bin. The latter method leads to smaller 
luminosity contributions by TP-AGB stars. We refer to 
recent studies from Maraston et al. (2005), van der Wei 
et al. (2006) and Maraston et al. (2006) for discussions 
of the model differences in greater detail. 

For our purpose it is sufficient to state that a given 
V — J color corresponds to younger ages, lower mass-to- 
light ratios and thus lower masses for the M05 model than 
for the BC03 model. Most importantly, we note that for 
M05 models inclusion of NIR data does not reduce stellar 
mass uncertainties caused by the unknown star formation 
history. 

5.2. Constraints on mass, dust and age from modeling 
our observed galaxies 

5.2.1. Wavelength dependence: optical vs near-infrared 

Having investigated the qualitative relationship be- 
tween M/L and the rest-frame optical-to-NIR color in 
H5.1.1I we now quantify the effect of inclusion of IRAC 
MIR photometry on the stellar population constraints 
of galaxies at 2 < z < 3.5. Our goal is to investigate 
whether and how the addition of IRAC imaging changes 
our best estimate of the stellar population properties and 
their confidence intervals. 

First we compare the distribution of stellar mass, dust 
content and mass- weighted stellar age as fit with or with- 
out IRAC. The upper row of Fig.llllshows a direct com- 
parison of the inferred model parameters with or without 
IRAC photometry for all galaxies at 2 < z < 3.5. The 
filled histogram in the lower row of Fig.llllshows the dis- 
tribution of mass, dust extinction and age derived from 
the full U-to-8^m SED. The dotted line indicates the 
distribution of best-fitting parameters from modeling the 
U-to-K photometry. Both the median and the width of 
the distribution stays the same for all three parameters. 
Defining the difference between mass, mass- weighted age, 
and Ay as Alog(M) = log{M withIRAC ) -log(M noIRAC ), 
A Ay = Av.yjithiRAc ~ Ay noIRAC , and Alog(age w ) = 
\og(age w , wit hin.Ac) ~ \og(age w , noIRA c) we find a me- 
dian and normalized median absolute deviation (equal 
to the rms for a gaussian distribution) [x, a n m ad{x)) of 
(-0.007 ±0.009, 0.07), (0.00 ± 0.03, 0.30), and (0.00 ± 
0.02, 0.16) respectively. The average and standard devi- 
ation ((.t); er(x)) of Alog(M), AAy and A\og(age w ) are 
(-0.04 ±0.02; 0.13), (-0.08 ± 0.04; 0.36) and (-0.02 ± 
0.03; 0.28) respectively. Thus the differences for the 
galaxy sample as a whole after including IRAC are very 
small. The results for stellar mass are similar to what 
Shapley et al. (2005) found for a more specific sample of 



optically selected star- forming galaxies at z ~ 2. 

Having determined that the overall distribution of 
best-fit age, dust content and stellar mass does not 
change after including IRAC, the question remains 
whether IRAC helps to improve the constraints on the 
stellar population characeristics for individual galax- 
ies. We address this question using the measure 
CnoiRAC /o~withiRAC, defined as the ratio of confidence 
intervals without and with IRAC. The la confidence in- 
tervals, representing random uncertainties propagating 
from photometric errors, are derived from Monte Carlo 
simulations. For each galaxy SED we create 100 mock 
SEDs where the flux-point in each band is randomly 
drawn from a Gaussian with the measured flux as the 
mean, and its error as the standard deviation. Next, 
each SED was fit with the same fitting procedure as the 
observed version. As we want to isolate the effect of in- 
cluding IRAC observations on the confidence intervals we 
fix the redshift to z p hot (or z spec where available). In cal- 
culating o-noiRAc/vwithiRAC we measure the confidence 
interval in log-space for stellar mass and mass-weighted 
age and in magnitude for Ay. Furthermore we set a 
lower limit to the confidence intervals to account for the 
discreteness of our models, i.e. age and Ay steps. 

Fig. shows the values of a no i R Ac / 'vwithiRAC for 
mass, age and dust content as a function of rest-frame 
(U — V) rest color for the galaxies at 2 < z < 3.5. We 
divide the sample into a blue and red bin and indicate 
the median reduction of confidence intervals for each bin 
with dashed lines. The separation between blue and red 
is chosen to be (U — V) res t — 1, corresponding to the 
oberved (J — K)y ega > 2.3 color cut for Distant Red 
Galaxies at the median redshift of our sample z = 2.66. 
We find that the typical improvement of confidence inter- 
vals is dependent on galaxy color for all considered stellar 
population parameters. For red galaxies, the reduction 
amounts to a median factor of 2.7, 1.7 and 2.9 in the case 
of stellar mass, Ay and age respectively. For blue galax- 
ies the reduction of the mass confidence interval is only a 
factor 1.3, though with a large scatter, while for Ay and 
age no median reduction is found. With the color tracks 
of the stellar population models in mind (see Fig. I8I9|I 
this color dependence should come as no surprise. We 
demonstrated in H5.1.1I that for blue galaxies optical to 
near-infrared colors are degenerate with the mass-to-light 
ratio. Hence, the IRAC bands of blue galaxies contribute 
little information about their mass. 

For a sample of (generally blue) optically selected star- 
forming galaxies at z ~ 2 Shapley et al. (2005) found a 
reduction in stellar mass uncertainties by a factor 1.5 — 2 
due to the addition of IRAC photometry, which seems 
like a contradiction. However, the distribution of ob- 
served R— K color of their galaxies extends towards red- 
der colors than the Lyman-break galaxies (LBGs) in our 
sample, which may partly explain the larger improve- 
ment than we find for blue LBGs. Another important 
difference is that Shapley et al. (2005) lacked J and H 
images and hence did not probe rest-frame U—B or U—V 
for their galaxies. It is possible that the lack of the near 
infrared J and H bands in Shapley et al. (2005) is the 
main reason for the discrepancy. We simulated this ef- 
fect by omitting J and H and repeating the Monte Carlo 
simulations with and without IRAC. The median reduc- 
tion of the ler mass confidence interval now increases to 



11 



S 
2 



< 

o 
c 



o 



12.0 
1 1.5 
1 1.0 
10.5 
10.0 

9.5 
9.0 



3.0 ■ 
2.5 ■ 



o 2.0 ■ 
< 



9.0 9.5 10.0 10.5 1 1.0 1 1.5 12.0 
log(M. with IRAC) [M ] 



30 



20 



10 



with IRAC 
no IRAC 



0L: 

9.0 



9.5 10.0 10.5 1 1.0 1 1.5 12.0 
log(M.) [M ] 



1.5 ■ 



* i-oh — 1 

0.5 



0.0 



FT 



m 0.5 ■ 
0.0 ■ 
-0.5 ■ 



o 
c 

1 £ -1.01= 

o 



CP 

o 



-1.5 ■ 




0.0 0.5 1.0 1.5 2.0 2.5 3.0 
\ with IRAC 



-1.5 -1.0 -0.5 0.0 0.5 
log(age„ with IRAC) [Gyr] 





30 ■ 



20 ■ 



10 ■ 



-1.5 -1.0 -0.5 0.0 0.5 
!og(oge w ) [Gyr] 



Fig. 11. — Upper row: Comparison of best-fit stellar masses, dust extinctions and mass- weighted ages for galaxies at 2 < z < 3.5 
when fit with IRAC photometry or without. The error bars are based on Monte Carlo simulations given the photometric errors. Lower 
row: Corresponding histograms with IRAC photometry (filled) or without (dashed). No significant change in the overall distributions is 
observed, but the best-fit properties of individual galaxies may change substantially. 
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Fig. 12. — Tightening of the confidence interval around best-fit stellar mass, age and dust extinction as a function of rest-frame U — V 
color for galaxies at 2 < z < 3.5. The median improvement after including the IRAC photometry is a factor 2.7 for red galaxies (dashed 
red line), significantly larger than the factor 1.3 for blue galaxies (dashed blue line). A similar color-dependence is found for constraints on 
age and dust extinction. 



a factor 1.9 when including IRAC. 

We conclude that, in the presence of very deep ob- 
served J, H and K photometry, inclusion of mid- infrared 
data places little extra constraints on the stellar popula- 



tions of blue galaxies. However, for galaxies redder than 
(U — V) res t = 1, IRAC reduces the confidence interval 
by a substantial factor 2.5 — 3. 



12 




1.0 1.5 2.0 2.5 3.0 3.5 



cn 
O 



0.0 0.5 1.0 1.5 2.0 

(u-v) rest 




-1.0L 



10 11 
log(M.) [M ] 



12 



Fig. 13. — Difference between best-fit stellar mass as derived from BC03 and M05 models as a function of redshift, (U — V)rest color 
and BC03 stellar mass for galaxies with Ly > 5 X 1O 9 L0 at 1 < z < 2 (open symbols) and 2 < z < 3.5 (filled symbols). The stellar masses 
derived from BC03 models are systematically higher than those derived from M05 models by a factor 1.4. The scatter in \og{M BC03 1 'M mos) 
is 0.1 in dcx. No significant dependence of log(Af BCO'A 1 M mos) on redshift, (U — V) r est color or stellar mass is found. The bias introduced 
by the choice of stellar population synthesis model amounts to a maximum of 15% over the whole (U — V) re st color or stellar mass range 
of our sample. 



5.2.2. Model dependence: Bruzual & Chariot vs 
Maraston 

In H5. 1.21 we pointed out strong differences in the rest- 
frame optical-to-NIR colors between the BC03 and M05 
models. In this paragraph we quantify how our results 
change if we use M05 models. The median and nor- 
malized median absolute deviation, average and stan- 
dard deviation of the differences between BC03 fits 
and M05 fits (Alog(M) B C03_M05, ^v,BC03JH05 and 
A log(ageu,)sco3_M05) are summarized in Table 
As expected, the BC03 models predict older ages and 
thus higher stellar masses than the M05 models for our 
z = 2 — 3.5 galaxies. The estimated mass for the M05 
models is systematically lower by a factor 1.4. Maraston 
et al. (2006) found a similar discrepancy for a sample of 
7 galaxies in the Hubble Ultra Deep Field that satisfy the 
BzK criterion (Daddi et al. 2004) for z > 1.4 passively 
evolving galaxies. Apart from a systematic shift a scatter 
of 0.1 in dex is found in A log(M) bco3_mo5; meaning the 
choice of stellar population synthesis model introduces a 
considerable systematic uncertainty. 

It is of great importance to test whether 

A log(M) B C03_A/05 = log(Af BC 03) ~ k>g(M M 05) 
correlates with redshift, color or stellar mass, since 
such dependencies, if present, could bias studies of 
galaxy evolution or trends with mass. In Fig. 1131 we 
plot A log(M)sC03_M05 versus redshift, (U - V) rest 
color, and stellar mass (the latter derived from BC03 
models). We show galaxies with Ly > 5 x 10 9 L Q at 
1 < z < 2 (open symbols) and at 2 < z < 3.5 (filled 
symbols); no evidence for a redshift dependence is found. 
For the (U — V) res t (middle) and stellar mass (right) 
panel, the p-values for statistical significance from the 
Spearman rank order correlation test are also larger 
than 0.05, meaning no significant correlation is found. 
Fitting a line to the points in the A log(M)sco3_M05 
versus (U — V) rest diagram, a difference of 0.06 dex in 
Alog(M)sco3_M05 is found over the 2 mag range in 
(U — V) res t color spanned by the galaxies in our sample. 
Even if a trend of increasing A log(M) bco3_M05 with 



redder (U — V) rest color is real, it only introduces a 
small bias of the order of 15%. A similar conclusion can 
be drawn for the dependence on stellar mass. 

5.2.3. Metallicity dependence 

We test how variations from solar metallicity affect 
the estimates of stellar mass, mass-weighted age and 
dust extinction. We study the effect of a different metal 
abundance by fitting BC03 templates with metallicity 
Z = 0.2 Zq to the observed SEDs, leaving the extinction 
law to Calzetti et al. (2000). NIR spectroscopy of DRGs 
(van Dokkum et al. 2004) and LBGs (Erb et al. 2006a) 
indicates that a range of Z = 0.2 — 1 Zq is appropriate 
for galaxies at 2 < z < 3.5. Furthermore, at metallicities 
below Z — 0.2 Zq the tracks and spectral libraries used 
to build the BC03 templates become more uncertain by 
lack of observational constraints. Decreasing the metal- 
licity from Z = Zq to Z = 0.2 Zq lowers the estimated 
stellar masses of galaxies at 2 < z < 3.5 by 0.1 dex, 
leads to a mass- weighted age that is typically lower by 0.2 
dex, and is compensated by an average increase in Ay of 
0.2 mag. The fact that age estimates are more strongly 
affected than estimates of stellar mass when changing 
the assumed metallicity was demonstrated in detail by 
Worthey (1994). While absolute values of ages and dust 
extinctions may be biased as just described, the relative 
age and dust trends within the galaxy population as dis- 
cussed in |2] based on the standard SED modeling (see 
Q are robust. 

5.2.4. Dependence on extinction law 

The Calzetti et al. (2000) extinction law was empiri- 
cally derived from observations of local starburst galax- 
ies. We quantify the variations in stellar population 
properties due to the adopted extinction law by compar- 
ing our modeling results with a Calzetti et al. (2000) law 
to those obtained with reddening laws from Fitzpatrick 
(1986) for the Large Magellanic Cloud (LMC) and Prevot 
et al. (1984) for the Small Magellanic Cloud (SMC), 
leaving the metallicity to solar. Stellar masses, mass- 
weighted ages and Ay values of galaxies at 2 < z < 3.5 
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Table 5. Differences between stellar population properties derived 
from BC03 and M05 



modian(A s co3_M05) &NMAD (AbC03_M05 ) mean(A s c03_M05) c(A.BC03-M05) 



Iog(M«) 0.14 ±0.01 0.06 0.15 ±0.01 0.08 
A v -0.20 ±0.00 0.00 -0.18 ±0.02 0.19 
log( Age w ) 0.29 ±0.02 017 0.34 ± 0.02 0.18 



derived with the LMC law models are similar to those ob- 
tained with the Calzetti et al. (2000) law. The SMC law, 
which rises more steeply towards shorter wavelengths 
in the near-UV, gives similar mass estimates, Ay val- 
ues that are on average smaller by 0.3 mag and mass- 
weighted stellar ages that are older by 0.23 dex, with the 
ages of the oldest galaxies being limited by the age of the 
universe constraint. As for metallicity, we conclude that 
using a different extinction law has a larger impact on 
the age estimates than on estimates of stellar mass. 

6. STELLAR MASS - OPTICAL COLOR RELATION 

In this section we study the relation between the rest- 
frame optical color of high rcdshift galaxies and their 
stellar mass. We start with a model-independent ap- 
proach, plotting rest-frame (U — V) res t versus rest-frame 
Jrest magnitude for all galaxies at 2 < z < 3.5 in Fig. 
1141 The emission of low mass long-lived stars that make 
up the bulk of the mass in a galaxy peaks in the rest- 
frame near-infrared. Jrest is therefore expected to be 
a reasonably good tracer of stellar mass. The galaxies 
that satisfy the DRG selection criterion (red symbols) 
are found at redder (U — V) res t than the Lyman-break 
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Fig. 14. — Rest-Frame U — V color versus absolute J magnitude 
for galaxies at 2 < z < 3.5. Lyman-break galaxies are plotted in 
blue. DRGs (red symbols) populate the red side of the U — V color 
distribution. Black symbols denote those objects that do not meet 
either criteria. The solid line marks the K-band selection of our 
sample. The dust vector indicates an extinction of Ay = 1 mag. 
The most luminous galaxies in the rest-frame near-infrared have 
redder rest-frame optical colors than fainter galaxies. 
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Fig. 15. — Rest-Frame U — V color versus stellar mass for galaxies 
at 2 < z < 3.5. DRGs are marked in red, LBGs in blue. The K- 
band selection of our sample is indicated by the solid line. The dust 
vector indicates an extinction of Ay = 1 mag. Low-mass galaxies 
with red colors might exist, but would not enter the sample. The 
most massive galaxies have redder U — V colors than less massive 
galaxies; notice the striking absence of massive blue galaxies. 



galaxies (blue symbols). The reddest (U — V) rest col- 
ors are found at the brightest J rest magnitudes. Note 
however that the observed trend is partially driven by 
the K-band selection of our sample. The line on Fig. 
1141 indicates at which magnitude a galaxy with identi- 
cal colors to our observed galaxies would fall out of the 
sample. Even if we only consider galaxies brighter than 
the limiting J res t = —21.5 to which we are complete over 
the whole (U — V) res t color range, we find that galaxies 
redder than (U — V) rest — 1 are 1 mag brighter than 
galaxies with (U — V) rest < 1, significant at the 3er level. 
Studying a sample without color bias (as advocated by 
van Dokkum et al. 2006) proves crucial to pick up the 
trend of (U — V) r est with J res t- We note that Meurer et 
al. (1999) found that LBGs with higher rest-frame UV 
luminosities tend to have redder rest-frame UV colors, 
illustrating that, while trends of color with luminosity 
are most notable in samples without color bias, they are 
still present in at least some color selected samples. 

If Jrest is a reasonable tracer of stellar mass, we expect 
to see a similar or stronger trend of (U — V) res t with 
the stellar mass. This is shown in Fig. ^| The plotted 
mass is derived from 1-component SED modeling of the 



14 



U-to-8/itm SED as described in 321 The typical error 
bar is indicated in the lower left corner. The depth of 
our K detection band allows us to probe stellar masses 
from 3 x 10 11 M Q down to 2 x 10 9 M Q . A correlation 
of (U — V) res t with stellar mass is clearly visible. The 
most massive galaxies have a red optical color. Ly man- 
break galaxies and other blue galaxies at high redshift 
contain typically 5 times less stellar mass than the DRGs 
in our sample. Again the K-band selection of our sample 
(solid line) limits our ability to detect faint red galaxies. 
Therefore we can not exclude the presence of low-mass 
red galaxies. The lack of massive blue galaxies seems 
to be real. Rigopoulou et al. (2006) find a co-moving 
density of $ = (1.6 ± 0.5) x 10^ Mpc~ 3 for LBGs with 
M > 1O 11 M0 at an average redshift (z) ~ 3, consistent 
with the absence of such massive but rare LBGs in our 
sample. 

However, the lack of massive blue galaxies could be 
an artifact of our choice of simple star formation histo- 
ries. As demonstrated in ^IB .1.11 a severe underestimate 
of the stellar mass is possible when the true star forma- 
tion history is more complex than that of the modeled 
1-component stellar population. When a young burst of 
star formation is superposed on a maximally old popu- 
lation, its blue light will dominate the (U — V) res t color 
and the mass from the underlying population will be hid- 
den. In order to constrain the possible underestimate in 
mass, we fit 2-component models to our SEDs. Erb et 
al. (2006b) describe a procedure to achieve this in two 
steps, where first a maximally old population is fit to the 
K(+IRAC) data and subsequently a young population is 
fit to the (primarily UV) residual. However, this proce- 
dure does not guarantee a good fit in the x 2 sense. In- 
stead, we decided to perform a simultaneous fit of both 
old and young components. We constructed template 
SEDs consisting of a maximally old single stellar popu- 
lation with a recent burst of star formation that started 
100 Myr ago and lasted till the moment of observation 
superposed. We made templates where the mass frac- 
tion created in the burst is 2 X with x going from -6 to 
2 in steps of 1. We assume that the same reddening by 
dust applies to the old and the young population, with 
Ay ranging from to 3 in steps of 0.2. Without this as- 
sumption, one could in principle hide an infinite amount 
of mass in an old population as long as an optically thick 
medium is shielding it from our sight. However, such 
a scenario is physically implausible. Since we are inter- 
ested in an upper limit on the mass, as opposed to the 
most likely value, we do not search for the least-squares 
solution over all x. Instead we perform the fit for every 
burst fraction and select the highest mass that still has 

^^red ^ red, 2 — component Xred,min,l — component ^ ^" 

Fitting the 2-component models to the U-to-8/Ltm SEDs 
of our galaxy sample at 2 < z < 3.5, we indeed see 
that a higher stellar mass is allowed when more com- 
plex star formation histories are adopted fFig. I16|) . The 
upper bound on stellar mass that we derive from this 
particular 2-burst model is in the median a factor 1.7 
higher than the 1-component estimate for galaxies red- 
der than ([/ — V) res t = 1. For blue galaxies the me- 
dian increase is a factor 2.1. Despite the fact that more 
mass can be hidden in blue galaxies, a trend of optical 
color with stellar mass remains visible. We performed a 
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Fig. 16. — Rest-Frame U — V color versus stellar mass for 
galaxies at 2 < z < 3.5 with complex star formation histo- 
ries. For each object a vector starts at the best-fit mass from 
1-component SED modeling. The vector ends at an upper limit 
for the stellar mass as obtained by fitting 2-component models. 
The 2-component model is composed of a maximally old stellar 
population with a second burst of star formation during the last 
100 Myr superposed. Empty symbols refer to objects for which 

Xred,2- component ~ xled,l- component > 2 for a11 Considered burst 

fractions. The mass that is plotted here corresponds to the burst- 
fraction that gave the lowest X re d- The typical amount of mass 
that can be hidden under the glare of a young secondary burst 
is on average larger for blue than for red galaxies. Nevertheless, 
even allowing for more complex star formation histories than 1- 
component models a lack of massive blue galaxies remains visible. 



Mann- Whitney U-test to compare the (U — V) res t colors 
of galaxies with different stellar mass. We conservatively 
adopted the 1-component stellar mass for galaxies with 
(U — V) res t > 1 and the 2-componcnt upper limit for ob- 
jects with (U — V) res t < 1. To avoid selection effects we 
only consider galaxies more massive than M = 10 10 Mq. 
Dividing them in two mass bins with an equal number 
of objects the Mann- Whitney U-test (Walpole & Myers 
1985) confirms at a 99% significance level that the mean 
of the (U — V) res t distributions differs. Applying the 
same 2-component models to the U-to-K SEDs (omitting 
IRAC), the median upper mass estimate increases to a 
factor 2.3 above the 1-component estimate for red ob- 
jects and a factor 3.7 for blue objects. We conclude that, 
as expected from H5.1.1I more mass can be hidden in blue 
than in red galaxies, but this effect is insufficient to re- 
move the trend of stellar mass with color. Furthermore, 
the amount of mass that can be hidden is constrained by 
addition of IRAC photometry. 

The color dependence that we derive for the amount of 
mass that can be hidden in an underlying old population 
confirms findings from Shapley et al. (2005) based on a 
sample of star- forming galaxies at z ~ 2. The predom- 
inance of distant red galaxies at the high-mass end was 
illustrated recently by van Dokkum et al. (2006) using 
a mass-selected sample of galaxies at 2 < z < 3 with 
M > 10 11 Mq. Only with very deep imaging such as 
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that of the HDFS analyzed in this paper it is possible 
to probe down to lower masses and prove that the most 
massive galaxies have red (U — V) rest colors compared 
to lower mass galaxies. 

7. SUMMARY 

We investigated the rest-frame optical to near-infrared 
color distribution of galaxies up to 2 < z < 3.5 in the 
Hubble Deep Field South. At all redshifts, galaxies with 
redder (U — V) res t tend to have redder (V — J) T est, as 
is the case in the local universe. At (U — V) rest colors 
comparable to that of local galaxies, the color distribu- 
tion of distant galaxies extends to redder (V — J) res t- 
At (U — V) res t > 1 the population of galaxies at the 
red (V — J)rest end is well described by dust-enshrouded 
star-forming models, whereas galaxies with (V — J) r est 
similar to that of local galaxies are consistent with old 
passively evolving systems. We conclude that (U— V) res t 
alone allows us to isolate blue relatively unobscured star- 
forming galaxies, but addition of ( V — J) r est is necessary 
to distinguish young dusty from old passively evolved 
systems. At redshifts above z = 1, this means IRAC ob- 
servations are crucial in understanding the wide variety 
in stellar populations. We note that our analysis is not 
subject to uncertainties due to field-to-field variations, 
but surveys over much larger areas are needed to study 
the relative contributions of galaxies with different stellar 
populations. 

We analyzed the constraints that IRAC places on stel- 
lar mass, stellar age and dust content of galaxies at 
2 < z < 3.5. No evidence is found for systematic offsets 
when determining the stellar population characteristics 



with or without IRAC. However, the ratio of confidence 
intervals on stellar mass, mass- weighted age and dust ex- 
tinction is typically reduced by a factor 2.7, 2.9 and 1.7 
respectively for red ((U — V) rest > 1) galaxies. In gen- 
eral, IRAC does not provide stronger constraints for blue 
galaxies ((U — V) res t < 1) when very deep near-infrared 
imaging is available (as is the case for the HDFS). 

We caution that, in characterizing the stellar popula- 
tions using M05 models, we find stellar masses that are 
typically a factor 1.4 lower than for BC03 models with a 
scatter of 0.1 in dex. 

A trend of brighter J res t with redder (U — V) res t is 
observed for galaxies at 2 < z < 3.5, where the near- 
infrared luminosity serves as a (imperfect but model- 
independent) tracer for stellar mass. Plotting (U— V) res t 
versus modeled stellar mass, we arrive at a similar con- 
clusion: the most massive galaxies in our sample have red 
rest-frame optical colors. A possible concern is that this 
trend with mass is caused by our simplistic choice of star 
formation histories. When we allow for more complex 
star formation histories, more mass can be hidden than 
in the case of a 1-component stellar population and the 
amount depends on the color of the galaxy. We used 2- 
component stellar populations, consisting of a maximally 
old population with a young population superposed, to 
set an upper bound on the stellar mass present. Even 
though relatively more mass can be hidden in blue galax- 
ies compared to red galaxies, under the assumption of an 
equal dust reddening of the young and old component, 
a trend of (U — V) res t increasing to redder colors with 
stellar mass remains visible. 
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